Analyzing the Impact of UMLS Relations on Word-sense Disambiguation Accuracy
نویسندگان
چکیده
Word-sense disambiguation (WSD) is the process of finding the correct meaning of words that have multiple meanings. The unsupervised WSD algorithm is the type of WSD algorithm that leverages an external source of knowledge to guide the disambiguation process. The unsupervised WSD algorithm type is attracting more interest in the biomedical domain because of its implementation practicality, especially when it leverages the knowledge sources of the Unified Medical Language System (UMLS), but still the resulted accuracy of the unsupervised WSD algorithm is lower than its supervised alternative. In this study we analyze the impact of using different subsets of the UMLS on the resulted accuracy of the unsupervised WSD algorithm. Our findings show that there are better ways to leverage the UMLS than using it as a monolithic source of knowledge. © 2011 Published by Elsevier Ltd. Selection and/or peer-review under responsibility of [name organizer]
منابع مشابه
Unsupervised Monolingual and Bilingual Word-Sense Disambiguation of Medical Documents using UMLS
This paper describes techniques for unsupervised word sense disambiguation of English and German medical documents using UMLS. We present both monolingual techniques which rely only on the structure of UMLS, and bilingual techniques which also rely on the availability of parallel corpora. The best results are obtained using relations between terms given by UMLS, a method which achieves 74% prec...
متن کاملWord embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation
Word sense disambiguation helps identifying the proper sense of ambiguous words in text. With large terminologies such as the UMLS Metathesaurus ambiguities appear and highly effective disambiguation methods are required. Supervised learning algorithm methods are used as one of the approaches to perform disambiguation. Features extracted from the context of an ambiguous word are used to identif...
متن کاملImproving Summarization of Biomedical Documents Using Word Sense Disambiguation
We describe a concept-based summarization system for biomedical documents and show that its performance can be improved using Word Sense Disambiguation. The system represents the documents as graphs formed from concepts and relations from the UMLS. A degree-based clustering algorithm is applied to these graphs to discover different themes or topics within the document. To create the graphs, the...
متن کاملResearch and applications: Word sense disambiguation in the clinical domain: a comparison of knowledge-rich and knowledge-poor unsupervised methods
OBJECTIVE To evaluate state-of-the-art unsupervised methods on the word sense disambiguation (WSD) task in the clinical domain. In particular, to compare graph-based approaches relying on a clinical knowledge base with bottom-up topic-modeling-based approaches. We investigate several enhancements to the topic-modeling techniques that use domain-specific knowledge sources. MATERIALS AND METHOD...
متن کاملWord sense disambiguation in the clinical domain: a comparison of knowledge-rich and knowledge- poor unsupervised methods
To cite: Chasin R, Rumshisky A, Uzuner O, et al. J Am Med Inform Assoc 2014;21:842–849. ABSTRACT Objective To evaluate state-of-the-art unsupervised methods on the word sense disambiguation (WSD) task in the clinical domain. In particular, to compare graphbased approaches relying on a clinical knowledge base with bottom-up topic-modeling-based approaches. We investigate several enhancements to ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013